Data center WAN aggregation to optimize hybrid cloud connectivity

ABSTRACT

An example method of optimizing connectivity between data centers in a hybrid cloud system having a first data center managed by a first organization and a second data center managed by a second organization, the first organization being a tenant in the second data center. The method includes probing a wide area network (WAN) with test packets by varying an internet protocol (IP) flow tuple of the test packets across a set of IP flows. The method includes identifying a plurality of paths between a gateway of the first data center and another gateway of the second data center associated with the set of IP flows. The method further includes selecting an IP flow from the set of IP flows for an application executing in the first data center. The method further includes establishing a path-optimized connection between the gateway and the other gateway through the WAN having the selected IP flow for use by the application.

BACKGROUND

Cloud architectures are used in cloud computing and cloud storage systems for offering infrastructure-as-a-service (IaaS) cloud services. Examples of cloud architectures include the VMware vCloud Director® cloud architecture software, Amazon EC2™ web service, and OpenStack™ open source cloud computing service. IaaS cloud service is a type of cloud service that provides access to physical and/or virtual resources in a cloud environment. These services provide a tenant application programming interface (API) that supports operations for manipulating IaaS constructs, such as virtual machines (VMs) and logical networks.

A hybrid cloud system aggregates the resource capability from both private and public clouds. A private cloud can include one or more customer data centers (referred to herein as “on-premise data centers”). The public cloud can include a multi-tenant cloud architecture providing IaaS cloud services. Typically, the customer data centers are connected to the cloud data centers through a wide area network (WAN) comprising multiple service provider backbone networks. As such, there can be multiple communication paths between customer data centers and cloud data centers. Given the many communication paths, it is desirable to optimize connectivity between customer data centers and cloud data centers in a hybrid cloud system.

SUMMARY

One or more embodiments provide techniques for optimizing connectivity between data centers in a hybrid cloud computing system. In an embodiment, a method of optimizing connectivity between data centers in a hybrid cloud system having a first data center managed by a first organization and a second data center managed by a second organization, the first organization being a tenant in the second data center. The method includes probing a wide area network (WAN) with test packets by varying an internet protocol (IP) flow tuple of the test packets across a set of IP flows. The method includes identifying a plurality of paths between a gateway of the first data center and another gateway of the second data center associated with the set of IP flows. The method further includes selecting an IP flow from the set of IP flows for an application executing in the first data center. The method further includes establishing a path-optimized connection between the gateway and the other gateway through the WAN having the selected IP flow for use by the application.

In another embodiment, a computer system includes a virtualized computing system, and a gateway coupled between the virtualized computing system and a wide area network (WAN). The gateway is configured to probe the WAN with test packets by varying an internet protocol (IP) flow tuple of the test packets across a set of IP flows. The gateway is further configured to identify a plurality of paths between the gateway and another gateway associated with the set of IP flows. The gateway is further configured to select an IP flow from the set of IP flows for an application executing in the virtualized computing system. The gateway is further configured to establish a path-optimized connection between the gateway and the other gateway through the WAN having the selected IP flow for use by the application.

Further embodiments include a non-transitory computer-readable storage medium comprising instructions that cause a computer system to carry out the above method.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a block diagram of a hybrid cloud computing system in which one or more embodiments of the present disclosure may be utilized.

FIG. 2 is a block diagram of a portion of a hybrid cloud computing system in which one or more embodiments of the present disclosure may be utilized.

FIG. 3 is a block diagram depicting a logical view of hybrid cloud computing system of FIG. 2 according to embodiments.

FIG. 4 is a flow diagram depicting a method of identifying and classifying paths in a wide area network (WAN) according to embodiments.

FIG. 5 illustrates an example database that can be maintained by a gateway for identifying and classifying paths in a WAN according to embodiments.

FIG. 6 is a flow diagram depicting a method of optimizing connectivity between data centers in a hybrid cloud computing system according to embodiments.

FIG. 7 is a block diagram depicting an example of a computer system in which one or more embodiments of the present disclosure may be utilized.

To facilitate understanding, identical reference numerals have been used, where possible, to designate identical elements that are common to the figures. It is contemplated that elements disclosed in one embodiment may be beneficially utilized on other embodiments without specific recitation.

DETAILED DESCRIPTION

FIG. 1 is a block diagram of a hybrid cloud computing system 10 in which one or more embodiments of the present disclosure may be utilized. Hybrid cloud computing system 10 includes a plurality of virtualized computing systems implemented within on-premise data centers and a cloud computing system 11. In the example of FIG. 1, hybrid cloud computing system 10 includes on-premise data centers 12-1, 12-2, 14, 16, 18, and 20, each of which is communicatively coupled to cloud computing system 11. In the example of FIG. 1, cloud computing system 11 includes cloud data centers 11-1, 11-2, and 11-3. The number of cloud data centers and the number of on-premise data centers shown in FIG. 1 is just one example. In general, there can be any number of on-premise data centers communicatively coupled to cloud computing system 11, which can include any number of cloud data centers.

Hybrid cloud computing system 100 is configured to provide a common platform for managing and executing virtual workloads seamlessly between on-premise data centers and cloud data centers. In one embodiment, an on-premise data center may be a data center controlled and administrated by a particular enterprise or business organization, while cloud data centers of cloud computing system 11 may be operated by a cloud computing service provider and exposed as a service available to account holders, such as the particular enterprise in addition to other enterprises. As such, on-premise data center(s) of an enterprise may sometimes be referred to as a “private” cloud, and cloud computing system 11 may be referred to as a “public” cloud.

As used herein, an internal cloud or “private” cloud is a cloud in which a tenant and a cloud service provider are part of the same organization, while an external or “public” cloud is a cloud that is provided by an organization that is separate from a tenant that accesses the external cloud. For example, the tenant may be part of an enterprise, and the external cloud may be part of a cloud service provider that is separate from the enterprise of the tenant and that provides cloud services to different enterprises and/or individuals. In embodiments disclosed herein, a hybrid cloud is a cloud architecture in which a tenant is provided with seamless access to both private cloud resources and public cloud resources.

In the example of FIG. 1, on-premise data centers 12-1, 14, and 16 are communicatively coupled to cloud data center 11-1, and on-premise data centers 12-2, 18, and 20 are communicatively coupled to cloud data center 11-2. A given on-premise data center can be coupled to one or more cloud data centers through one or more network connections, including direct network connections (e.g., private connections) and/or public network connections (e.g., public Internet connections). On-premise data centers 12-1 and 12-2 can be controlled and administered by the same enterprise, whereas on-premise data centers 14, 16, 18, and 20 can be controlled and administered by separate enterprises. In some embodiments, on-premise data center 12-1 can be communicatively coupled to on-premise data center 12-2 through one or more network connections. That is, a given enterprise's private cloud can include a plurality of on-premise data centers with network connectivity therebetween. Likewise, cloud data centers 11-1, 11-2, and 11-3 can include one or more network connections to support network connectivity therebetween.

In some embodiments, cloud data centers 11-1, 11-2, and 11-3 are located to support particular geographic regions. Thus, on-premise data centers 12-1, 14, and 16 can be located in one geographic region served by cloud data center 11-1. On-premise data centers 12-2, 18, and 20 can be located in another geographic region served by cloud data center 11-2. Cloud data center 11-3 can serve yet another geographic region having one or more on-premise data centers (not shown).

FIG. 2 is a block diagram of a hybrid cloud computing system 100 in which one or more embodiments of the present disclosure may be utilized. Hybrid cloud computing system 100 includes a virtualized computing system implementing an on-premise data center 102 and a virtualized computing system implementing a cloud data center 150. Hybrid cloud computing system 100 is a subset of hybrid cloud computing system 10 with one on-premise data center and one cloud data center. It is to be understood that each on-premise data center coupled to a cloud computing system can be configured similarly to on-premise data center 102, and each cloud data center that is part of a cloud computing system can be configured similarly to cloud data center 150. In this regard, on-premise data center 102 can be one of on-premise data centers 12-1, 12-2, 14, 16, 18, 20, and cloud data center 150 can be one of cloud data centers 11-1, 11-2, and 11-3 shown in FIG. 1.

On-premise data center 102 includes one or more host computer systems (“hosts 104”). Hosts 104 may be constructed on a server grade hardware platform 106, such as an x86 architecture platform. As shown, hardware platform 106 of each host 104 may include conventional components of a computing device, such as one or more processors (CPUs) 108, system memory 110, a network interface 112, storage system 114, and other I/O devices such as, for example, a mouse and keyboard (not shown). CPU 108 is configured to execute instructions, for example, executable instructions that perform one or more operations described herein and may be stored in memory 110 and in local storage. Memory 110 is a device allowing information, such as executable instructions, cryptographic keys, virtual disks, configurations, and other data, to be stored and retrieved. Memory 110 may include, for example, one or more random access memory (RAM) modules. Network interface 112 enables host 104 to communicate with another device via a communication medium, such as a network 122 within on-premise data center 102. Network interface 112 may be one or more network adapters, also referred to as a Network Interface Card (NIC). Storage system 114 represents local storage devices (e.g., one or more hard disks, flash memory modules, solid state disks, and optical disks) and/or a storage interface that enables host 104 to communicate with one or more network data storage systems. Examples of a storage interface are a host bus adapter (HBA) that couples host 104 to one or more storage arrays, such as a storage area network (SAN) or a network-attached storage (NAS), as well as other network data storage systems.

Each host 104 is configured to provide a virtualization layer that abstracts processor, memory, storage, and networking resources of hardware platform 106 into multiple virtual machines 120 ₁ to 120 _(N) (collectively referred to as VMs 120) that run concurrently on the same hosts. VMs 120 run on top of a software interface layer, referred to herein as a hypervisor 116, that enables sharing of the hardware resources of host 104 by VMs 120. One example of hypervisor 116 that may be used in an embodiment described herein is a VMware ESXi™ hypervisor provided as part of the VMware vSphere® solution made commercially available from VMware, Inc. of Palo Alto, Calif. Hypervisor 116 may run on top of the operating system of host 104 or directly on hardware components of host 104.

On-premise data center 102 includes a virtualization management component (depicted in FIG. 2 as virtualization manager 130) that may communicate to the plurality of hosts 104 via a network, sometimes referred to as a management network 126. In one embodiment, virtualization manager 130 is a computer program that resides and executes in a central server, which may reside in on-premise data center 102, or alternatively, running as a VM in one of hosts 104. One example of a virtualization manager is the vCenter Server™ product made available from VMware, Inc. Virtualization manager 130 is configured to carry out administrative tasks for computing system 102, including managing hosts 104, managing VMs 120 running within each host 104, provisioning VMs, migrating VMs from one host to another host, and load balancing between hosts 104.

In one embodiment, virtualization manager 130 includes a hybrid cloud management module (depicted as hybrid cloud manager 132) configured to manage and integrate virtualized computing resources provided by cloud computing system 150 with virtualized computing resources of computing system 102 to form a unified “hybrid” computing platform. Hybrid cloud manager 132 is configured to deploy VMs in cloud computing system 150, transfer VMs from virtualized computing system 102 to cloud computing system 150, and perform other “cross-cloud” administrative tasks. In one implementation, hybrid cloud manager 132 is a module or plug-in complement to virtualization manager 130, although other implementations may be used, such as a separate computer program executing in a central server or running in a VM in one of hosts 104.

In one embodiment, hybrid cloud manager 132 is configured to control network traffic into network 122 via a gateway component (depicted as a gateway 124). Gateway 124 (e.g., executing as a virtual appliance) is configured to provide VMs 120 and other components in on-premise data center 102 with connectivity to an external wide area network (WAN) 140 (e.g., the public Internet). Gateway 124 may manage external public IP addresses for VMs 120 and route traffic incoming to and outgoing from on-premise data center 102 and provide networking services, such as firewalls, network address translation (NAT), dynamic host configuration protocol (DHCP), load balancing, and virtual private network (VPN) connectivity over WAN 140. As described further herein, gateway 124 can optimize connectivity between on-premise data center 102 and cloud data center 150 through WAN 140.

In one or more embodiments, cloud data center 150 is configured to dynamically provide an enterprise (or users of an enterprise) with one or more virtual data centers 180 in which a user may provision VMs 120, deploy multi-tier applications on VMs 120, and/or execute workloads. Cloud data center 150 includes an infrastructure platform 154 upon which a cloud computing environment 170 may be executed. In the particular embodiment of FIG. 2, infrastructure platform 154 includes hardware resources 160 having computing resources (e.g., hosts 162 ₁ to 162 _(N)), storage resources (e.g., one or more storage array systems, such as SAN 164), and networking resources, which are configured in a manner to provide a virtualization environment 156 that supports the execution of a plurality of virtual machines 172 across hosts 162. It is recognized that hardware resources 160 of cloud computing system 150 may in fact be distributed across multiple data centers in different locations.

Each cloud computing environment 170 is associated with a particular tenant of cloud computing system 150, such as the enterprise providing on-premise data center 102. In one embodiment, cloud computing environment 170 may be configured as a dedicated cloud service for a single tenant comprised of dedicated hardware resources 160 (i.e., physically isolated from hardware resources used by other users of cloud computing system 150). In other embodiments, cloud computing environment 170 may be configured as part of a multi-tenant cloud service with logically isolated virtualized computing resources on a shared physical infrastructure. As shown in FIG. 2, cloud data center 150 may support multiple cloud computing environments 170, available to multiple enterprises in single-tenant and multi-tenant configurations.

In one embodiment, virtualization environment 156 includes an orchestration component 158 (e.g., implemented as a process running in a VM) that provides infrastructure resources to cloud computing environment 170 responsive to provisioning requests. For example, if an enterprise required a specified number of virtual machines to deploy a web applications or to modify (e.g., scale) a currently running web application to support peak demands, orchestration component 158 can initiate and manage the instantiation of virtual machines (e.g., VMs 172) on hosts 162 to support such requests. In one embodiment, orchestration component 158 instantiates virtual machines according to a requested template that defines one or more virtual machines having specified virtual computing resources (e.g., compute, networking, storage resources). Further, orchestration component 158 monitors the infrastructure resource consumption levels and requirements of cloud computing environment 170 and provides additional infrastructure resources to cloud computing environment 170 as needed or desired. In one example, similar to on-premise data center 102, virtualization environment 156 may be implemented by running on hosts 162 VMware ESXi™-based hypervisor technologies provided by VMware, Inc. (although it should be recognized that any other virtualization technologies, including Xen® and Microsoft Hyper-V® virtualization technologies may be utilized consistent with the teachings herein).

In one embodiment, cloud data center 150 may include a cloud director 152 (e.g., run in one or more virtual machines) that manages allocation of virtual computing resources to an enterprise for deploying applications. Cloud director 152 may be accessible to users via a REST (Representational State Transfer) API (Application Programming Interface) or any other client-server communication protocol. Cloud director 152 may authenticate connection attempts from the enterprise using credentials issued by the cloud computing provider. Cloud director 152 maintains and publishes a catalog 166 of available virtual machine templates and packaged virtual machine applications that represent virtual machines that may be provisioned in cloud computing environment 170. A virtual machine template is a virtual machine image that is loaded with a pre-installed guest operating system, applications, and data, and is typically used to repeatedly create a VM having the pre-defined configuration. A packaged virtual machine application is a logical container of pre-configured virtual machines having software components and parameters that define operational details of the packaged application. An example of a packaged VM application is vApp technology made available by VMware, Inc., although other technologies may be utilized. Cloud director 152 receives provisioning requests submitted (e.g., via REST API calls) and may propagates such requests to orchestration component 158 to instantiate the requested virtual machines (e.g., VMs 172). One example of cloud director 152 is the VMware vCloud Director® produced by VMware, Inc.

In the embodiment of FIG. 2, cloud computing environment 170 supports the creation of a virtual data center 180 having a plurality of virtual machines 172 instantiated to, for example, host deployed multi-tier applications. A virtual data center 180 is a logical construct that provides compute, network, and storage resources to an organization. Virtual data centers 180 provide an environment where VM 172 can be created, stored, and operated, enabling complete abstraction between the consumption of infrastructure service and underlying resources. VMs 172 may be configured similarly to VMs 120, as abstractions of processor, memory, storage, and networking resources of hardware resources 160.

Virtual data center 180 includes one or more virtual networks 182 used to communicate between VMs 172 and managed by at least one networking gateway component (e.g., gateway 184), as well as one or more isolated internal networks 186 not connected to gateway 184. Gateway 184 (e.g., executing as a virtual appliance) is configured to provide VMs 172 and other components in cloud computing environment 170 with connectivity to WAN 140 (e.g., the public Internet). Gateway 184 manages external public IP addresses for virtual data center 180 and one or more private internal networks interconnecting VMs 172. Gateway 184 is configured to route traffic incoming to and outgoing from virtual data center 180 and provide networking services, such as firewalls, network address translation (NAT), dynamic host configuration protocol (DHCP), and load balancing. Gateway 184 may be configured to provide virtual private network (VPN) connectivity over WAN 140 with another VPN endpoint, such as gateway 124 within on-premise data center 102. In other embodiments, gateway 184 may be configured to connect to communicate with on-premise data center 102 using a high-throughput, dedicated link (depicted as a direct connect 142) between on-premise data center 102 and cloud computing system 150. In one or more embodiments, gateways 124 and 184 are configured to provide a “stretched” layer-2 (L2) network that spans on-premise data center 102 and virtual data center 180, as shown in FIG. 2.

While FIG. 2 depicts communication between on-premise gateway 124 and cloud-side gateway 184 for illustration purposes, it should be recognized that communication between multiple on-premise gateways 124 and cloud-side gateways 184 may be used. Furthermore, while FIG. 2 depicts a single instance of a gateway 184, it is recognized that gateway 184 may represent multiple gateway components within cloud data center 150. In some embodiments, a separate gateway 184 may be deployed for each virtual data center, or alternatively, for each tenant. In some embodiments, a gateway instance may be deployed that manages traffic with a specific tenant, while a separate gateway instance manages public-facing traffic to the Internet. In yet other embodiments, one or more gateway instances that are shared among all the tenants of cloud data center 150 may be used to manage all public-facing traffic incoming and outgoing from cloud data center 150.

In one embodiment, each virtual data center 180 includes a “hybridity” director module (depicted as hybridity director 174) configured to communicate with the corresponding hybrid cloud manager 132 in on-premise data center 102 to enable a common virtualized computing platform between on-premise data center 102 and cloud data center 150. Hybridity director 174 (e.g., executing as a virtual appliance) may communicate with hybrid cloud manager 132 using Internet-based traffic via a VPN tunnel established between gateways 124 and 184, or alternatively, using direct connection 142. In one embodiment, hybridity director 174 may control gateway 184 to control network traffic into virtual data center 180. In some embodiments, hybridity director 174 may control VMs 172 and hosts 162 of cloud data center 150 via infrastructure platform 154.

FIG. 3 is a block diagram depicting a logical view of hybrid cloud computing system 100 according to embodiments. Various applications 302 execute within on-premise data center 102 and are configured for communication with on-premise gateway 124 to obtain access to WAN 140. Applications 302 can include any software application, process, thread, or the like executing on a computer (e.g., virtual or physical) within on-premise data center 102. Likewise, various applications 320 execute within cloud data center 150 and are configured for communication with cloud gateway 184 to obtain access to WAN 140. Applications can include any software application, process, thread, or the like executing on a computer (e.g., virtual or physical) within cloud data center 150.

Some applications 302 in on-premise data center 102 can cooperate with other applications 320 in cloud data center 150. As such, some applications 302 can communicate with other applications 320 through WAN 140. For example, a VM migration process executing within on-premise data center 102 can cooperate with a VM migration process executing within cloud data center 150 to migrate a VM from on-premise data center 102 to cloud data center 150 over WAN 140. VM migration is merely one example of a myriad of applications designed to cooperate through communication over WAN 140. To initiate communication, an application 302 can communicate with on-premise gateway 124 to establish a connection through WAN 140 between on-premise gateway 124 and cloud gateway 184. Alternatively, an application 302 can communicate with cloud gateway 184 to establish a connection through WAN 140 between on-premise gateway 124 and cloud gateway 184.

WAN 140 includes a plurality of communication nodes. Each communication node can include one or more network devices, such as routers, switches, and the like. Different sets of communication nodes can be managed by different service providers, such as network service providers (NSPs), Internet service providers (ISPs), and the like. In the example of FIG. 3, WAN 140 includes communication nodes 304 through 318. Communication nodes 306, 308, and 310 are controlled by a service provider 322. Communication nodes 314 and 316 are controlled by a service provider 324. Communication node 312 is controlled by a service provider 326. Communication node 304 comprises an edge node coupled to on-premise gateway 124 and can be controlled by a service provider or by the enterprise that controls on-premise data center 102. Communication node 318 comprises an edge node coupled to cloud gateway 184 and can be controlled by a service provider or by the cloud service provider that controls cloud data center 150.

Service providers 322, 324, and 326 are typically third parties with respect to the enterprise controlling on-premise data center 102 and the cloud service provider controlling cloud data center 150. As such, neither the enterprise nor the cloud service provider has control over the communication nodes in WAN 140, other than potentially the edge nodes 304 and 318. As such, neither the enterprise nor the cloud service provider can control the path through WAN 140 for a connection between on-premise gateway 124 and cloud gateway 184. Service providers 322, 324, and 326 can implement one or more traffic management schemes to control traffic flow through their communication nodes. Example traffic management schemes include traffic shaping, traffic policing, and the like. Some traffic management schemes are content-based and can manage traffic according to the different applications that generate the traffic. Other traffic management schemes are route-based and can manage traffic according to different Internet Protocol (IP) flows. An IP flow is defined by an IP flow tuple of source IP address, source port, destination IP address, and destination port. The traffic management schemes implemented by service providers 322, 324, and 326 can affect the performance (e.g., latency, data rate, etc.) of connections between on-premise gateway 124 and cloud gateway 184. In some cases, the performance of an arbitrary connection through WAN 140 can less than that required by a given application. For example, a VM migration process can time-out or otherwise fail if the latency of a connection exceeds a particular threshold. While packet encryption (e.g., VPN) can be used to avoid content-based traffic management, such encryption does not avoid route-based traffic management based on IP flow.

In embodiments, gateways 124 and 184 are configured to optimize connectivity through WAN 140. FIG. 4 is a flow diagram depicting a method 400 of identifying and classifying paths in WAN 140 according to embodiments. Method 400 can be performed by a gateway, such as on-premise gateway 124 or cloud gateway 184. For purposes of clarity by example, method 400 is described as being performed by on-premise gateway 124.

Method 400 begins at step 402, where on-premise gateway 124 probes WAN 140 to identify paths between on-premise data center 102 and cloud data center 150. At any given time, WAN 140 can route packets (generally referred to as traffic) between on-premise gateway 124 and cloud gateway 184 through different sets of communication nodes. A path through WAN 140 includes a particular set of communication nodes. In an embodiment, on-premise gateway 124 can send and receive test traffic (test packets) to and from cloud gateway 184 to identify different paths. The test traffic can include different IP flows in an attempt to identify different paths through WAN 140. As discussed above, an IP flow is defined by an IP flow tuple. At step 408, on-premise gateway 124 can vary the IP flow tuple of the test traffic sent between on-premise gateway 1224 and cloud gateway 184 over WAN 140. In some embodiments, one or more of the source IP address, source port, destination IP address, and destination port can be varied for the test traffic. At a given time, one IP flow can cause traffic to flow through one path, and another IP flow can cause traffic to flow through another path. Traffic management schemes within WAN 140 are generally controlled by a network provider and may be outside the control of the organizations managing the on-premise and cloud data centers. For example, one traffic management scheme within WAN 140 may shape traffic based on port numbers used in the flow. In this example, on-premise gateway 124 varies the port numbers (source and/or destination) of the test traffic, which cause different routing paths to form within WAN 140 (as a result of the traffic management schemes). A set of IP flows can be tested by varying the IP flow tuple of the test traffic to identify a set of paths through WAN 140. Each path in the resulting set of paths can be associated with one or more IP flows. Conversely, each IP flow in the set of tested IP flows can be associated with one or more paths. At step 409, one or more performance metrics can be determined for each resulting path. Example performance metrics include latency and data rate.

In the example of FIG. 3, there is a path comprising nodes 304, 306, 308, and 318; another path comprising nodes 304, 310, and 318; another path comprising nodes 304, 310, 314, 316, and 318; and another path comprising nodes 304, 306, 308, 312, and 318. Each of the paths can exhibit different performance (e.g., different latency, different data rates, etc.). On-premise gateway 124 can send and receive test traffic using a set of IP flows to be tested. One or more of the IP flows can result in the test traffic traversing the path comprising nodes 304, 306, 308, and 318. One or more other of the IP flows can result in the test traffic traversing the path comprising nodes 304, 310, and 318. Other IP flows can result in the test traffic traversing the other paths. In this manner, different paths through WAN 140 can be identified and associated with the corresponding IP flow tuples.

At step 404, on-premise gateway 124 classifies the IP flows based on performance calculated from the determined performance metric(s) for the resulting paths. Performance metrics for one or more associated paths can be combined in various ways to compute an overall performance of a given IP flow. Thus, some IP flow can be classified as having higher performance, while other IP flows can be classified as having lower performance.

At optional step 406, on-premise gateway 124 can map different policies to the IP flows based on the calculated performance. Each policy can specify a certain level of performance. The policies can then be assigned to different types of application traffic either automatically by on-premise gateway 124, or specifically by an administrator. To facilitate automatic policy assignment, each policy can specify one or more constraints that need to be met before application traffic can be assigned that policy. The constraints can be based on various attributes, such as application traffic type, time of day, and the like.

FIG. 5 illustrates an example database 500 that can be maintained by on-premise gateway 124 according to embodiments. Database 500 includes a list of IP flows. For each IP flow, database 500 includes a performance associated with that IP flow. Database 500 can optionally include other information for each IP flow, such as which policies are mapped to each IP flow, which paths resulted from each IP flow, and the like. On-premise gateway 124 can repeatedly perform method 400 to maintain and update database 500 over time. Method 400 can be performed by any other gateway in a hybrid cloud computing system in a similar fashion.

FIG. 6 is a flow diagram depicting a method 600 of optimizing connectivity between data centers in a hybrid cloud computing system according to embodiments. Method 600 is described as being performed by on-premise gateway 124, but can be performed by any other gateway within hybrid cloud computing system.

Method 600 begins at step 602, where on-premise gateway 124 identifies and classifies paths in WAN. For example, on-premise gateway 124 can perform method 400 described above to maintain database 500 described above. At step 604, on-premise gateway 124 selects an IP flow for application traffic originating from an application 302. For example, at step 608, on-premise gateway 124 can determine a policy for the application traffic and select an IP flow based on the determined policy. As noted above, an administrator can assign a policy to particular types of application traffic, or on-premise gateway 124 can automatically assign a policy to the application traffic. Alternatively, at step 610, on-premise gateway 124 can determine performance requirements of the application traffic and select an IP flow based on performance. That is, rather than using policies, application traffic can be assigned to a particular IP flow based on performance requirements.

At step 606, on-premise gateway 124 establishes a path-optimized connection between to cloud gateway 184 through WAN 140. A path-optimized connection is a connection selected for the application traffic based on performance or policy, as described above. Step 606 can include various sub-steps. At step 612, on-premise gateway 124 can establish a secure channel with cloud gateway 184 (e.g., a VPN connection). When establishing the secure channel, on-premise gateway 124 can communicate with cloud gateway 184 through WAN 140. On-premise gateway 124 can inform cloud gateway 184 of the IP flow to be used for the secure channel.

At step 614, on-premise gateway 124 can encapsulate the application traffic within path-optimized traffic having an IP flow tuple associated with the selected IP flow. At step 616, on-premise gateway 124 encrypts the path-optimized traffic in accordance with the parameters of the established secure channel. At step 618, on-premise gateway 124 transmits the path-optimized traffic to cloud gateway 184 over the secure channel. At step 620, on-premise gateway 124 receives path-optimized traffic from cloud gateway 184 over the secure channel. At step 622, on-premise gateway 124 decrypts the path-optimized traffic and decapsulates the path-optimized traffic obtain application traffic.

FIG. 7 is a block diagram depicting an example of a computer system 700 in which one or more embodiments of the present disclosure may be utilized. Computer system 700 can be used as a host to implement on-premise gateway 124, cloud gateway 184, or other gateway in a hybrid cloud computing system. Computer system 700 includes one or more central processing units (CPUs) 702, memory 704, input/output (IO) circuits 706, and various support circuits 708. Each of CPUs 702 can include any microprocessor known in the art and can execute instructions stored on computer readable storage, such as memory 704. Memory 704 can include various volatile and/or non-volatile memory devices, such as random access memory (RAM), read only memory (ROM), and the like. Instructions and data 710 for performing the various methods and techniques described above can be stored in memory 704 for execution by CPUs 702. That is, memory 704 can store instructions executable by CPUs 702 to perform one or more steps/sub-steps described above in FIGS. 4 and 6. Support circuits 708 include various circuits used to support operation of a computer system as known in the art.

The various embodiments described herein may employ various computer-implemented operations involving data stored in computer systems. For example, these operations may require physical manipulation of physical quantities—usually, though not necessarily, these quantities may take the form of electrical or magnetic signals, where they or representations of them are capable of being stored, transferred, combined, compared, or otherwise manipulated. Further, such manipulations are often referred to in terms, such as producing, identifying, determining, or comparing. Any operations described herein that form part of one or more embodiments of the invention may be useful machine operations. In addition, one or more embodiments of the invention also relate to a device or an apparatus for performing these operations. The apparatus may be specially constructed for specific required purposes, or it may be a general purpose computer selectively activated or configured by a computer program stored in the computer. In particular, various general purpose machines may be used with computer programs written in accordance with the teachings herein, or it may be more convenient to construct a more specialized apparatus to perform the required operations.

The various embodiments described herein may be practiced with other computer system configurations including hand-held devices, microprocessor systems, microprocessor-based or programmable consumer electronics, minicomputers, mainframe computers, and the like.

One or more embodiments of the present invention may be implemented as one or more computer programs or as one or more computer program modules embodied in one or more computer readable media. The term computer readable medium refers to any data storage device that can store data which can thereafter be input to a computer system—computer readable media may be based on any existing or subsequently developed technology for embodying computer programs in a manner that enables them to be read by a computer. Examples of a computer readable medium include a hard drive, network attached storage (NAS), read-only memory, random-access memory (e.g., a flash memory device), a CD (Compact Discs)—CD-ROM, a CD-R, or a CD-RW, a DVD (Digital Versatile Disc), a magnetic tape, and other optical and non-optical data storage devices. The computer readable medium can also be distributed over a network coupled computer system so that the computer readable code is stored and executed in a distributed fashion.

Although one or more embodiments of the present invention have been described in some detail for clarity of understanding, it will be apparent that certain changes and modifications may be made within the scope of the claims. Accordingly, the described embodiments are to be considered as illustrative and not restrictive, and the scope of the claims is not to be limited to details given herein, but may be modified within the scope and equivalents of the claims. In the claims, elements and/or steps do not imply any particular order of operation, unless explicitly stated in the claims.

Virtualization systems in accordance with the various embodiments may be implemented as hosted embodiments, non-hosted embodiments or as embodiments that tend to blur distinctions between the two, are all envisioned. Furthermore, various virtualization operations may be wholly or partially implemented in hardware. For example, a hardware implementation may employ a look-up table for modification of storage access requests to secure non-disk data.

Certain embodiments as described above involve a hardware abstraction layer on top of a host computer. The hardware abstraction layer allows multiple contexts to share the hardware resource. In one embodiment, these contexts are isolated from each other, each having at least a user application running therein. The hardware abstraction layer thus provides benefits of resource isolation and allocation among the contexts. In the foregoing embodiments, virtual machines are used as an example for the contexts and hypervisors as an example for the hardware abstraction layer. As described above, each virtual machine includes a guest operating system in which at least one application runs. It should be noted that these embodiments may also apply to other examples of contexts, such as containers not including a guest operating system, referred to herein as “OS-less containers” (see, e.g., www.docker.com). OS-less containers implement operating system—level virtualization, wherein an abstraction layer is provided on top of the kernel of an operating system on a host computer. The abstraction layer supports multiple OS-less containers each including an application and its dependencies. Each OS-less container runs as an isolated process in userspace on the host operating system and shares the kernel with other containers. The OS-less container relies on the kernel's functionality to make use of resource isolation (CPU, memory, block I/O, network, etc.) and separate namespaces and to completely isolate the application's view of the operating environments. By using OS-less containers, resources can be isolated, services restricted, and processes provisioned to have a private view of the operating system with their own process ID space, file system structure, and network interfaces. Multiple containers can share the same kernel, but each container can be constrained to only use a defined amount of resources such as CPU, memory and I/O. The term “virtualized computing instance” as used herein is meant to encompass both VMs and OS-less containers.

Many variations, modifications, additions, and improvements are possible, regardless the degree of virtualization. The virtualization software can therefore include components of a host, console, or guest operating system that performs virtualization functions. Plural instances may be provided for components, operations or structures described herein as a single instance. Boundaries between various components, operations and data stores are somewhat arbitrary, and particular operations are illustrated in the context of specific illustrative configurations. Other allocations of functionality are envisioned and may fall within the scope of the invention(s). In general, structures and functionality presented as separate components in exemplary configurations may be implemented as a combined structure or component. Similarly, structures and functionality presented as a single component may be implemented as separate components. These and other variations, modifications, additions, and improvements may fall within the scope of the appended claim(s). 

We claim:
 1. A method of optimizing connectivity between data centers in a hybrid cloud system having a first data center managed by a first organization and a second data center managed by a second organization, the first organization being a tenant in the second data center, the method, comprising: using a gateway of the first data center, probing a wide area network (WAN) with test packets between the gateway of the first data center to another gateway of the second data center through communication nodes of the WAN that are managed by at least one third-party service provider with respect to the first and second organizations by varying an Internet Protocol (IP) flow tuple of the test packets across a set of IP flows, wherein at least one route-based traffic management scheme is implemented for the WAN by the at least one third-party service provider to control traffic flow through at least some of the communication nodes of the WAN, wherein neither the first organization nor the second organization has control over the at least some of the communication nodes of the WAN, and wherein varying the IP flow tuple of the test packets across the set of IP flows comprises varying source and destination port numbers of the test packets to cause a plurality of paths to form within the WAN as a result of the at least one route-based traffic management scheme; using the gateway of the first data center, identifying the paths through the communication nodes of the WAN between the gateway of the first data center and the another gateway of the second data center associated with the set of IP flows; using the gateway of the first data center, classifying each IP flow in the set of IP flows based on performance calculated by combining at least latency and data rate associated with one or more of the plurality of paths; using the gateway of the first data center, mapping a plurality of policies to the set of IP flows based on calculated performance of each of the set of IP flows, wherein the policies are automatically assigned to different types of application traffic by the gateway of the first data center, wherein each of the policies specifies one or more constraints to be met before an application traffic is assigned that policy, wherein the one or more constraints are based on application traffic type and time of day; using the gateway of the first data center, selecting an IP flow from the set of IP flows for the application such that the performance of the selected IP flow that is calculated from at least the latency and the data rate associated with the one or more of the plurality of paths satisfies the application-specific performance requirements of the application; and using the gateway of the first data center, establishing a path-optimized connection between the gateway and the other gateway through at least one of the communication nodes of the WAN having the selected IP flow for use by the application.
 2. The method of claim 1, wherein the WAN includes a public Internet having networks of a plurality of service providers.
 3. The method of claim 2, wherein at least one of the networks is configured with at least one traffic management scheme.
 4. The method of claim 1, wherein the step of selecting the IP flow comprises: receiving a policy of the plurality of policies designated for the application; wherein the selected IP flow is mapped to the received policy.
 5. The method of claim 1, wherein the step of establishing the path-optimized connection comprises: establishing a secure channel between the gateway and the other gateway; encapsulating application packets from the application within path-optimized packets according to the selected IP flow; and encrypting the path-optimized packets for transmission over the secure channel.
 6. The method of claim 5, wherein the step of establishing the secure channel comprises sending an IP flow tuple for the selected IP flow from the gateway to the other gateway.
 7. The method of claim 5, wherein the step of establishing the path-optimized connection comprises: receiving path-optimized packets from the other gateway over the secure channel; decrypting the received path-optimized packets; and decapsulating the received path-optimized packets based on the selected IP flow.
 8. A computer system, comprising: a virtualized computing system; and a gateway coupled between the virtualized computing system and a wide area network (WAN), the gateway configured to: probe the WAN with test packets between the gateway of a first data center managed by a first organization to another gateway of a second data center managed by a second organization through communication nodes of the WAN that are managed by at least one third-party service provider with respect to the computer system by varying an internet protocol (IP) flow tuple of the test packets across a set of IP flows, wherein at least one route-based traffic management scheme is implemented for the WAN by the at least one third-party service provider to control traffic flow through at least some of the communication nodes of the WAN, wherein neither the first organization nor the second organization has control over the at least some of the communication nodes of the WAN, and wherein varying the IP flow tuple of the test packets across the set of IP flows comprises varying source and destination port numbers of the test packets to cause a plurality of paths to form within the WAN as a result of the at least one route-based traffic management scheme; identify the paths through the communication nodes of the WAN between the gateway and the another gateway associated with the set of IP flows; classify each IP flow in the set of IP flows based on performance calculated by combining at least latency and data rate associated with one or more of the plurality of paths; map a plurality of policies to the set of IP flows based on calculated performance of each of the set of IP flows, wherein the policies are automatically assigned to different types of application traffic by the gateway of the first data center, wherein each of the policies specifies one or more constraints to be met before an application traffic is assigned that policy, wherein the one or more constraints are based on application traffic type and time of day; select an IP flow from the set of IP flows for the application such that the performance of the selected IP flow that is calculated from at least the latency and the data rate associated with the one or more of the plurality of paths satisfies the application-specific performance requirements of the application; and establish a path-optimized connection between the gateway and the other gateway through at least one of the communication nodes of the WAN having the selected IP flow for use by the application.
 9. The computer system of claim 8, wherein the WAN includes a public Internet having networks of a plurality of service providers.
 10. The computer system of claim 9, wherein at least one of the networks is configured with at least one traffic management scheme.
 11. The computer system of claim 8, wherein the gateway is configured to: receive a policy of the plurality of policies designated for the application; wherein the selected IP flow is mapped to the received policy.
 12. The computer system of claim 8, wherein the gateway is configured to: establish a secure channel to the other gateway; encapsulate application packets from the application within path-optimized packets according to the selected IP flow; and encrypt the path-optimized packets for transmission over the secure channel.
 13. A non-transitory computer readable medium comprising instructions, which when executed in a computer system, causes the computer system to carry out a method of optimizing connectivity between data centers in a hybrid cloud system having a first data center managed by a first organization and a second data center managed by a second organization, the first organization being a tenant in the second data center, the method comprising: using a gateway of the first data center, probing a wide area network (WAN) with test packets between the gateway of the first data center to another gateway of the second data center through communication nodes of the WAN that are managed by at least one third-party service provider with respect to the first and second organizations by varying an Internet Protocol (IP) flow tuple of the test packets across a set of IP flows, wherein at least one route-based traffic management scheme is implemented for the WAN by the at least one third-party service provider to control traffic flow through at least some of the communication nodes of the WAN, wherein neither the first organization nor the second organization has control over the at least some of the communication nodes of the WAN, and wherein varying the IP flow tuple of the test packets across the set of IP flows comprises varying source and destination port numbers of the test packets to cause a plurality of paths to form within the WAN as a result of the at least one route-based traffic management scheme; using the gateway of the first data center, identifying the paths through the communication nodes of the WAN between the gateway of the first data center and the another gateway of the second data center associated with the set of IP flows; using the gateway of the first data center, classifying each IP flow in the set of IP flows based on performance calculated by combining at least latency and data rate associated with one or more of the plurality of paths; using the gateway of the first data center, mapping a plurality of policies to the set of IP flows based on calculated performance of each of the set of IP flows, wherein the policies are automatically assigned to different types of application traffic by the gateway of the first data center, wherein each of the policies specifies one or more constraints to be met before an application traffic is assigned that policy, wherein the one or more constraints are based on application traffic type and time of day; using the gateway of the first data center, selecting an IP flow from the set of IP flows for the application such that the performance of the selected IP flow that is calculated from at least the latency and the data rate associated with the one or more of the plurality of paths satisfies the application-specific performance requirements of the application; and using the gateway of the first data center, establishing a path-optimized connection between the gateway and the other gateway through at least one of the communication nodes of the WAN having the selected IP flow for use by the application.
 14. The method of claim 1, wherein the at least one traffic management scheme implemented by the at least one third-party service provider comprises a content-based traffic management scheme that manages the traffic flow according to different applications that generate the traffic flow. 